Picture for Ye Tian

Ye Tian

ProAct: A Benchmark and Multimodal Framework for Structure-Aware Proactive Response

Add code
Feb 03, 2026
Viaarxiv icon

ClueTracer: Question-to-Vision Clue Tracing for Training-Free Hallucination Suppression in Multimodal Reasoning

Add code
Feb 02, 2026
Viaarxiv icon

SAMTok: Representing Any Mask with Two Words

Add code
Jan 22, 2026
Viaarxiv icon

LifeAgentBench: A Multi-dimensional Benchmark and Agent for Personal Health Assistants in Digital Health

Add code
Jan 20, 2026
Viaarxiv icon

An Fluid Antenna Array-Enabled DOA Estimation Method: End-Fire Effect Suppression

Add code
Dec 22, 2025
Figure 1 for An Fluid Antenna Array-Enabled DOA Estimation Method: End-Fire Effect Suppression
Figure 2 for An Fluid Antenna Array-Enabled DOA Estimation Method: End-Fire Effect Suppression
Figure 3 for An Fluid Antenna Array-Enabled DOA Estimation Method: End-Fire Effect Suppression
Figure 4 for An Fluid Antenna Array-Enabled DOA Estimation Method: End-Fire Effect Suppression
Viaarxiv icon

ESearch-R1: Learning Cost-Aware MLLM Agents for Interactive Embodied Search via Reinforcement Learning

Add code
Dec 21, 2025
Viaarxiv icon

CoPHo: Classifier-guided Conditional Topology Generation with Persistent Homology

Add code
Dec 17, 2025
Viaarxiv icon

MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation

Add code
Nov 18, 2025
Viaarxiv icon

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Add code
Oct 22, 2025
Viaarxiv icon

Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

Add code
Sep 08, 2025
Viaarxiv icon